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Abstract 

D 

Ph We show how to reduce the problem of computing VaR and CVaR with Student T return 

qq distributions to evaluation of analytical functions of the moments. This allows an analysis of the 

£N) risk properties of systems to be carefully attributed between choices of risk function (e.g. VaR vs 

CVaR); choice of return distribution (power law tail vs Gaussian) and choice of event frequency, 
for risk assessment. We exploit this to provide a simple method for portfolio optimization 
<^ when the asset returns follow a standard multivariate T distribution. This may be used as a 

Oh semi-analytical verification tool for more general optimizers, and for practical assessment of the 

r*j impact of fat tails on asset allocation for shorter time horizons. 

Keywords: VaR, CVaR, Portfolio Optimization, VaR Optimization, CVaR Optimization, 
i— i Optimisation. 
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\q 1 Introduction 

\D 

The non-Gaussian nature of asset returns has been established for many years from many per- 
CN spectives. From the point of view of statistical moment studies, Fama [3] and Mandelbrot [7] 

demonstrated the excess kurtosis of asset returns in the 1960s. Studies published in 2003 [6] fo- 
cused on tail analysis, claiming that the tails of return distribution functions exhibit power- law 
behaviour, often with inverse cubic decay. Maximum Likelihood analysis has also been pursued. 
Fergusson and Platen [1] exhibited the importance of Student-T characteristics in daily log-returns 
of major indices. In the follow studies we will use the Student T as a model of asset returns, in 
that it can simultaneously reproduce excess kurtosis, power-law tails and be at or close to the MLE 
distributional estimate in a variety of situation^] This is not to argue that this distribution is a 
universal panacea. Rather, it serves three other purposes. First, its manifestly greater likelihood in 
matching observed returns suggest that it is useful model for exploring risk characteristics and asset 
allocation in its own right. Second, it serves as a useful semi-analytical benchmark for more gen- 
eral computational models of optimization, to ensure their quality on an otherwise hard-to-obtain 
problem. VaR/CVaR optimization has been pioneered by Rockafeller and Uryasev [11], and general 
risk measure optimization by Shaw |13j . Finally, the reduction of VaR and CVaR optimization to 
an essentially closed-form problem description, with an objective function that may be explicitly 

'Department of Mathematics and Computer Science, University College London, (w.shaw@ucl.ac.uk). 

1 More recent work by Platen and co-workers [2] finds the T or Variance Gamma as an MLE amongst the hyperbolic 
distributional family. My own preliminary studies also find Johnson-SU distributions as a contender, with all 3 types 
many orders of magnitude more likely than Gaussian on equity index and asset log-returns 
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differentiated, opens the probiem up to implementation on a wide variety of off-the-shelf optimizers. 
All of these studies and many others not cited here are attempting to bring greater diversity for 
choices of risk function to the pioneering work by Markowitz and collaborators, e.g. [8, 9, 10J, and 
to build in explicit management of a variety of asset return distributions by functions beyond those 
that may be expressed simply in terms of the first two moments. 

The purpose of this note is not to engage in the already vigorous debate about VaR and CVaR 
as risk measures. There is a serious conceptual difficulty in estimating the probability of extreme 
events when there is so little data available. However, the view taken here is that several statistical 
studies over many decades based on the data that is available point to the Student T as a sensible 
model, and one that is many orders of magnitude more likely than a Gaussian model. In this 
sense the estimate of tail probabilities on such a basis is doing the best one can given the data, 
and is certainly more scientific than clinging to a Gaussian prescription and further indulging the 
pretence that unpredictable "tsunami" -type events generate all the more extreme events. A 25a 
event is roughly 10 130 times more likely in a T4 model (as found in [4j) than it is in a Gaussian. 
There certainly are isolated severe shocks but the routine is also highly non-normal and capable of 
extreme movements. 



2 Recap of Gaussian VaR and CVaR 

Let 4>{x) = (2it)- l l 2 e- x2 / 2 denote the PDF of the standard zero-mean and unit-variance normal 
distribution, and let &(x) denote the corresponding CDF with <J?' = <p. The Gaussian quantile 
function Qg(u) is the function satisfying the condition 

$(Q G (u)) = u (1) 

for all u : < u < 1. This function is not available in simple closed- form, nor is its applied 
mathematics counterpart the inverse error function. However, its properties are well understood 
through the development of many numerical approximations |18| Q] and more recently through its 
characterization as the solution of the non-linear ODE |15| 

Q" = Q(Q') 2 (2) 

with boundary conditions Q(l/2) = 0, Q'{l/2) = V2tt. 

In the case of a Gaussian distribution with mean /x and variance a 2 , the quantile function of 
that distribution is then \x + oQq{u) and the corresponding "Value at Risk", or VaR, is just the 
negative of the quantile, usually expressed in terms of a percentage number w = lOOu. That is 

VaR(w) = -fj, - o-Q G (w/100) (3) 

In what follows we will work in terms of the u-parametrization, with the function 

VaR(u) = -n- o-Q G (u) (4) 

So for example, the 2.5% VaR is given by 

- n - crQ G (0.025) n+ 1.96a (5) 

and other choices of percentile lead to different weightings of the standard deviation. 

The CVaR is a related entity, giving the average in a tail bounded by the corresponding VaR 
level . In general, for a density function f(x) linked to a random variable X, we would need to 
work out 

1 rQc(u) 

E[X\X < Qg(u)} = - / xf(x)dx . (6) 

U J -00 
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In the case f = 4>, the integration may be done trivially, because <\J = —x<f), so that for a standard 
normal distribution 

E[X\X< Q G (u)] = ~d>(Q G (u)) (7) 
and applying a scaling and translation to get the general case gives us, for the loss, 

CVaR(u) = —fj, + %(Q G (u)) (8) 

In the case of a Gaussian system, there is no substantive difference between the VaR and CVaR 
measures: both take the form 

Risk(u) = -\i + ip(u)o (9) 

where ■ipcvi'u) = —Qg{u) m the case of VaR and ipGc{u) = ^4>(Qg(u)) in the case of CVaR. And 
the numerical values are very similar. The numbers in Table 1 illustrate the impotence of switching 



quantile 

u 


-Qg(u) 


i>GC 

<P(Q G (u))/u 


Time scale 
(approx) 


0.100 


1.28155 


1.75498 


Fortnight 


0.025 


1.95996 


2.33780 


Two months 


0.010 


2.32635 


2.66521 


5 months 


10~ 4 


3.71902 


3.95848 


40Y 


10" 6 


4.75342 


4.94833 


4000Y 



Table 1: a- multipliers for Gaussian VaR and CVaR 

from VaR to CVaR while remaining in a Gaussian framework. There is essentially no change in the 
practical outcome of the risk computation, despite the improved mathematical properties of CVaR 
with regard to coherence. One presumes this effect is well-known. 

2.1 Recap of Gaussian VaR and CVaR Portfolio Optimization 

If the risk is now associated with the returns on a portfolio where the returns on TV" assets follow 
a multivariate Gaussian distribution, then if the asset weights are Wi, i = 1, . . . , N, the expected 
returns are /Xj, and the covariance is Cy, then the risk function may now be written as 



Risk(u, Wi) = - ^2 + ^( n ) , c i3 w i w i ( 10 ) 
i=i y i,j 

So both the VaR and CVaR optimizations are of the form of a mean-standard deviation optimization 
where the function ip characterizes the risk-aversity. This equation may be written down without 
further discussion because if Xj has a multivariate normal distribution then YliLi w i^i nas a 
univariate normal distribution for any collection of weights Wi. Supplemental constraints may be 
written down in the normal way - for a long-only fully-invested portfolio we would have the simple 
set 

N 

wi>o, Y, wi = 1 ■ ( n ) 

1=1 

and in general we might relax these to allow short-selling and/or add further matrix conditions or 
weight inequalities. 
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Even in the Gaussian case we also see that optimizing VaR and optimizing CVaR are different 
computations and wil have differing optimal weights, except in the case where the mean may be 
ignored, for example in the limit u — > 0, when the problem is pure variance minimization in either 
case. 

It should be clear that in a real- world context where such a Gaussian model might, where it 
is appropriate at all, be more properly applied to the logs of asset returns, we are assuming that 
the time period of interest is short enough such that the approximation e R ~ 1 + R may be used. 
This will also apply to the T model to be developed next. While this limits the general practical 
usefulness of Gaussian or Student T assumptions within a detailed optimization, it does not diminish 
the usefulness of a semi-analytical model for checking more general numerical techniques, and nor 
does it diminish the importance of getting a quantitative appreciation of the impact of fat tails on 
both the risk measures and the optimizations. 



3 VaR/CVaR for non-Gaussian models 

It will be clear from the discussion above that having a simple analytical model of the risk and 
portfolio optimization problems, at least of the specific type given previously, relies on the following 
properties in order to have a degree of tractability: 

1. Having a finite variance; 

2. Linear closure of the distributional family for the returns; 

3. Having a formula for the quantile (for the VaR); 

4. Being able to integrate x * f{x) (for the CVaR). 

We will now elaborate on the satisfaction of these tractability conditions for the Student T. This is 
really just a matter of assembling known facts and characterizing the distribution in the right way. 

3.1 Tractability of the Student T risk 

Student's T distribution was introduced by William Sealy Gosset in 1908 [16] in connection with 
the statistics of small samples. In Gosset's original construction, expressed in a modern notation, 
the T random variable is constructed from n + 1 (integer) standard independent normal random 
variables Zq, Z\, . . . , Z n as 



W^r 1 ' 2 ( 12 ) 



T = Z 

n 

k=l 

Rewriting this in terms of a chi-squared denominator of degree n - the 'degrees of freedom', we 
have 

T = Zo(^)' 1/2 (13) 

and the condition that n be an integer can then be relaxed, and we can consider real degrees of 
freedom in terms of a positive real variable v. 

T = Z {\l)- 1 ' 2 (14) 

The density of T may be derived by conditioning on the possible values of the denominator, and 
an elementary computation (see e.g. |12j ) leads to the probability density function 

1 T[(u + l)/2] 1 
n ^ V} ~^ I>/2] (l + t7^+D/2 UDj 
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When v > the variance of this distribution exists provide v > 2 and is then 

V(T) = (16) 

In the multivariate case there are many possible extensions of the T to consider - see e.g. Shaw 
and Lee [14J for an extensive discussion of the bivariate case, allowing for independent or non- 
independent marginals, and differing degrees of freedom in the marginals. However, for tractability 
of the risk problem we appear to be limited to the "standard" multivariate T where the Tj are 
given b£] 

N 

T t = (xl/v)- 1/2 ^2A tk Z k . (17) 

k=l 

for iV independent standard Gaussian random variables Zj and a mixing matrix Ay . The covariance 
of this system is then just 

E WA = VZ~ 2 {A - AT)l i (18) 
and if we have weights Wi , the random variable 

TV 

f = Y,WiTi (19) 

8=1 

also has a T-distribution with variance 

V{T) = -^Y^ Wi (A.A T ) ijWj (20) 

ij 

So in this case we manifestly have the requisite linear closure and simple expressions for the variance. 



3.2 Do we need finite variance? 

Taleb |17) has argued that "finiteness of variance is irrelevant in the practice of quantitative finance" , 
in a paper so titled. I should point out that I mostly agree with this. The Student T distribution 
is perfectly well defined in the domain 1 < v < 2 where the mean exists but the variance does not. 
In the domain 1 < v < 4 the kurtosis is already theoretically infinite and there is no reason of 
principle why we cannot consider infinite variance within this family either. What is pathological 
about the case v < 2 is that the paramatrization adopted here and the reduction to a mean-sd 
optimization will break down. We need a suitable L 1 model of the dispersion to employ and in 
terms of which to express the VaR and CVaR. This will be explored elsewhere, and for this paper 
we will remain in the conservative domain where the underlying variance exists. 

Within this conservative domain the real issue is whether we have a meaningful and sensible 
positive definite covariance, and all the usual caveats apply. Our historical estimates may be poor 
for the future for both volatilities and correlations, and over-writing such data with estimates from 
e.g. options or traders is fraught with conceptual and technical problems. The use of a what-if 
approach with correlations, or dealing with many dimensions, may easily produce non positive- 
definite structures, and this needs to be properly analysed and managed. 

2 If the reader wonders why the usual multivariate density is not being given, the reason is that the covariance and 
linear closure properties are much more transparent in this conditionally Gaussian form, which arises in simulation 
of the distribution and the associated copula. 
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3.3 VaR and CVaR in the T model 

In this case we are able to follow a similar pattern to that defined for the Gaussian case, except that 
we need to be careful with the variance normalization and manage the quantile structure. Suppose 
for now that we can find an expression for the quantile function Qt(u, v) for a standard univariate 
T distribution. If we wish to parametrize the problem again by mean and standard deviation, the 
VaR will be given by 

VaR(n) = —fj, — a ^^-^Q T ( u , v) (21) 

so that the relevant ip function for this case is not the (negative of the) standard T quantile but 
the negative of the scaled unit variance T quantil<@Q: 



^tv{u) = -Qt{u, v) = -J ^^Q T {u, v) (22) 

The VaR measure of risk is once again in standard form —\i + ipTv{ u )&- 

For the CVaR we no longer have an identity as simple as </>' = — xcj) that allowed the immediate 
integration of the Gaussian expressions, but it is a simple matter to find the integral of —th(t, v), 
since the density just contains powers of a linear function of t 2 . On doing some elementary inte- 
gration we find that the function 

v ) = V 2 J- K ,„, ' 23 

2v^r(|) 

is the correct expression, and the CVaR for a distribution with mean fj, and variance a 2 is now 



CVaR(u) = -fjt + aj- — -k(Q T (u,u),u)- (24) 

v u 



so that the ^-function is now 



r„-Ut ) J^-^k(Q T {u,i;),v)^ (25) 



Apart from the characterization of the Student T quantile function, we now have explicit expressions 
for VaR and CVaR for this distribution that are comparable with those that are already clear for 
the Gaussian case. 



4 Characterizing the Student T quantile 

The inverse CDF, of quantile function, for the Student T was firstQ studied by Hill in his short 
and beautiful 1970 paper [5]. I revisited the matter in [12] and exhibited the simple closed forms 
available for v = 1, 2, 4, with the two simple even cases arising as the available characterization as 
the solution for even v in terms of a polynomial of degree v — 1. The case v = 1 has been known 
for a long time as it is the Cauchy distribution quantile: 

Q(u,l) =tan[7r(it- 1/2)] (26) 

3 This would of course require modification to extend to the domain 1 < v < 2. 
4 So far as this author is aware. 
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The case v = 2 was known to Hill and takes the form, in our notation, 

^ 2 ) = ^r^ (27) 

y2u(l — u) 

Of course, neither of these older forms have finite variance. The only simple closed-form case with 
this property was derived in [12] for v = 4 and is given by 

a = 4u(l — u) 

g = -^cos { ^cos -1 (\/a) ) (28) 



Q(u,4) =sgn(n-l/2) v /^4 
As usual sgn(x) is +1 if x > and — 1 if x < 0. In this case the relevant unit variance function is 

^tv(u) = -^=Q(u,A) (29) 
4.1 Inverse Beta Function representation 

The CDF for general positive real v may be written in terms of /3-functions: 

F„( a? ) = i(l + a ga(x)(l-l w (| > i)) (30) 

giving an expression in terms of regularized /3-functions. The regularized beta function I x (a,b) 
employed here is given by 

B x (a,b) 

Ua > b) = ^bJ (31) 
where B(a,b) is the ordinary /3- function and B x (a, b) is the incomplete form 

B x (a,b)= [ X t^Xl-t^-^dt (32) 
Jo 

Having got such a representation, this may be formally inverted to give the formula for the quantile: 



()[ a. //) - sgn ( u - - 



\ \ J If[ U <i,2«,2(l-«)] (2' 2) 



This last result is useful in computer environments where representations of inverse beta functions 
are available. If only the (forward) ^-function is available, one can make the inversion by employing 
a stepping or bisection approach. For checking we give a Mathematica implementation in the form: 

Q[u_, nj := Module[{arg = If [u < 1/2, 2 u, 2 (1 - u)]>, 

Sign[u - 1/2] Sqrt[n*(l/InverseBetaRegularized[arg, n/2, 1/2] - 1)]] 

This is not fast enough for live Monte Carlo simulation by the inverse transform method but 
gives high-precision quantile numbers. These need to be multiplied by —\J (u — 2)/u to obtain the 
unit-variance loss ^-function. 
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4.2 Tail Series 

While for Monte Carlo simulation by inversion one needs rapidly-computable expressions for the 
entire interval < u < 1, with identified single- or double-precision accuracy, for our purposes here 
we just need a good approximation for the region < u < 0.025 in order to evaluate VaR/CVaR 
values at the 2.5% or less frequency levels. The following tail series is good enough to produce 
precision of better than 10 -3 for 2 < v < 11 and < u < 0.025: 

w = (u vy/*V[v/2]/T[{y + l)/2]f ,u 

6 

P = Y,d k w k (34) 
fc=l 

Q(u,u) = - v / u*(l/(3-l) 

The coefficients dk are given in Appendix A. The series is exact for u = 2 and the precision for 
2 < v < 4 is better than 10~ 5 . The derivation of this and the associated recurrence for making 
more terms is given in Section 3.4 of |15| . The final normalization to unit variance is also required 
to obtain tprv- 

4.3 Sample numbers 

Practitioners will perhaps find sample values of the loss functions ip more illuminating, and such 
precomputed values may of course be fed into an off-the-shelf optimizer. In Table 2 we report the 
zero mean loss functions ip as a multiple of the standard deviation. The table has some striking 
numbers in it. First note the general similarity of the levels of the 2.5% values for either VaR or 
CVaR for all the distributions - most are in the 2 — 3<r range. One can even see that the numbers 
decrease at very low powers. This is an odd effect but is correct due to the shape of the density. 
A casual analysis of losses within such 40-day windows might well lead one to conclude that a 
Gaussian model is enough or even conservative. The T4 2.5% quantile is almost exactly the same 
as the Gaussian value! 

But in the tails, for the values of v suggested by both the power-law tails and MLE estimation, 
there is serious danger of movements of more than 16<r, peaking at a 28.5<r average loss in events 
happening less than once every 40 years. 

4.4 Excel limitations 

The generation of such quantiles in major modelling environments ought to be a matter of routine, 
but the Excel program still continues to exhibit some awkwardness, even within Excel 2011. I 
itemized the problems with the older TINV function in a supplement to [T2] , still available at 

http : //www . mth . kcl . ac . uk/~shaww/web_page /paper s/Tsupp/TINV . pdf . 

In more recent versions of Excel, there is the better and new function T.INV which at least 
makes some direct computations of the quantiles in the left tail more clearly accessible. If you 
try, for example, T.INV(u,n) and normalize to unit variance you will get answers corresponding 
to those in table above. However, non-integer degrees of freedom are just not managed. If you 
set v = 3.9 you will get the results for v = 3. To be quite clear, Microsoft's online documenation 
for T.INV makes it clear that only integer degrees of freedom are managed. Associated with the 
variable 



deg_f reedom 
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Quantile Measures 


DOF 


Time ~ 

u = 


2M 
0.025 


5M 
0.01 


4Y 

10~ 3 


40Y 
10" 4 


V = oo 
(Gauss) 


'VLr V art 
i>GCVaR 


1.96 
2.34 


2.33 
2.67 


3.09 
3.37 


3.72 
3.96 


v = 6 


*r 1 V CLJX 

i^TCVaR 


2.00 
2.66 


2.57 
3.29 


4.25 
5.24 


6.55 
7.95 


v = 5 


IpTCVaR 


1.99 
2.73 


2.61 
3.45 


4.57 
5.82 


7.50 
9.44 


v = 4 


T J- V tilt 

IpTCVaR 


1.96 
2.82 


2.65 
3.69 


5.07 
6.85 


9.22 
12.3 


i/ = 3 


IpTVaR 
IpTCVaR 


1.84 
2.91 


2.62 
4.04 


5.90 
8.90 


12.8 
19.3 


i/ = 2.5 


IpTVaR 
IpTCVaR 


1.60 
2.78 


2.39 
4.07 


6.18 
10.33 


15.59 
26.00 


v = 2.25 


IpTVaR 
IpTCVaR 


1.29 
2.40 


2.00 
3.65 


5.68 
10.25 


15.85 
28.54 



Table 2: a- multipliers for Student T VaR and CVaR loss functions 

there is the following remark in the online documentation for Excel 2011 Mac edition: "If this 
argument contains a decimal value, this function ignores the numbers to the right side of the 
decimal point." So far as I can determine, quantiles for strictly integer degrees of freedom are 
computed well and deep into the tail. This is of course sufficient within the use of the T as a small 
sample statistic, where the degrees of freedom is an integer, but does not go far enough for our 
purposes, where non-integer degrees of freedom are commonplace. 

5 Risk visualization 

The functions iPdr(u) for D € {G, T} and R £ {V, C} characterize the loss function for the 
standardized zero- mean unit-variance case of Gaussian/T and VaR/CVaR. It it is important to do 
the unit-variance normalization for the T as otherwise in making comparisons one is not working 
on a like-with-like basis. A comparison of the ^-functions isolates the effect of the distributional 
choice on the loss. There may of course be additional effects due to a shift in the mean and an 
explosion in the variance, but we can now assess the effect on our perception of risk of changing 
the distribution alone to one that is much more realistic. At the same time we can parametrize the 
effect by the variable u, which gives us the event frequency. 

In Figure 1 we show a screen shot from a dynamic visualization^] of the VaR and CVaR as a 
function of the power-law in the tails, modelled as a T distribution. The blue curves are Gaussian, 
green a T with v = 4, and red a T with v = 2.25. There are two curves of each colour, with the 
upper curve being the CVaR. The loss is given in multiples of the standard deviation, so the curves 
are our ^-functions. The horizontal axis gives the logarithmic frequency. Note that in the most 
extreme case we can get to 30<7 events on sensible time-scales, whereas in a Gaussian model even 

5 These were first shown by the author at the NERC and the Knowledge Transfer Networks for Industrial Mathe- 
matics and for Financial Services workshop on Uncertainty and Risk Visualisation, Lloyd's of London, May 2010. 
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a 5cr event needs around 100 centuries. 



> 
Q 

00 



Daily CVaR vs VaR: 
Gaussian[B] vs T_4[G], T_2.25[R] 




-Log[10, freq] 

Figure 1: Loss functions for Gauss (blue), T2.25 (red), T4 (green): CVaR (upper) & VaR (lower) 



6 Portfolio Optimization 

The VaR and CVaR in any of these models is given by a risk function of the form 



N 



Risk(u, Wj) = - fiiWi + ip(u) /y] CijWiWj 



(35) 



8=1 



All that has changed by the introduction of the Student T model is the setting of a new scale for 
the a-weighting, which can now be dramatically higher when one worries about lower-frequency 
events. The values for some cases of interest can just be read off Table 2. By way of illustration, 
here are some sample problems: 

Problem 1 Optimize Gaussian VaR at at one in 40 day level: Minimize 



N 



i=l y i,j 

Problem 2 Optimize Gaussian CVaR at one in 40 year level: Minimize 

N , 



(36) 



y~] fJ-iWi + 3.96 / y] CijWiWj 



(37) 
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Note how in problems 1 and 2 the simultaneous switch to CVaR and switching days to years has 
only doubled the cr-weighting. On the other hand... 

Problem 3 Optimize Student T CVaR at one in 40 year level, with a cubic tail model: Minimize 



N 



fiiWi + 19.3 



ijWiWj 



(38) 



i=i 



'■j 



So it is intuitively clear that these problems are intuitively driving us to increase the risk-aversion. 
Once we have the VaR and CVaR optimizations in this form, they may be fed to a wide variety of 
simple optimizers to find the optimal weights. It is self-evident that the impact of the remodelling 
of the input distribution is to drive the risk-optimal solution closer to one of pure minimum variance 
as ip becomes larger, but all the points are on the same efficient frontier. 



Figure 2: Sampling an efficient frontier with optima for Gaussian VaR (blue) and Student T CVaR 
(red) 

This may easily be illustrated using any simple optimizer. With values of u in the form u = W~ x , 
with x in the range [1,5] in steps of a half, a simple 3D test problem was optimized first under 
Gaussian VaR and then under Student T CVaR with v = 3. The optima are shown in the figure 
in blue for Gaussian VaR and red for T CVaR, and an obvious pattern is obtained with return 
(vertical) plotted against variance (horizontal). The covariance matrix and expected returns are the 
same in both cases. A Mathematica notebook with the calculation details is available on request, 
but Figure 2 tells the obvious story. These results have also been compared with the random 
portfolio optimization method described in [13] and good agreement has been found with a sample 
of portfolio returns of size 100, 000 for larger u - values. 



7 Summary and Comments 

In this paper we have given analytical formulae for the VaR and CVaR of a system where the 
returns are characterized by a Student T distribution. The risks of a given loss measured in 
standard deviations are massively bigger than in the Gaussian case and may be computed simply. 
A switch from VaR to CVaR is in practice vacuous if the system remains Gaussian, but may make 
a large difference with power-law tails in the returns. The problem of optimizing VaR and CVaR is 
reduced to a trivial moment-based optimization and the outcome is to make a simple enhancement 
of the weighting of the variance component. Optima in a fat-tailed world are explicitly driven 
towards the pure minimum variance configuration. 
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None of these results should be used to infer that better investigation of tail statistics need not 
be done. Here maximum-likelihood estimates of historical distributions are being extrapolated into 
statements about tails, and all the usual caveats apply. 

We might ask whether this kind of analysis can be done fore more general distributions? For 
systems with finite variance, we can always apply a scaling and translation of the quantile function 
such that, for the VaR of a portfolio, 

VaR(u) = fj, + aQo(u, a, /?,... ) (39) 

where Q$ is a function of other parameters a,(3,. . . beyond the mean and variance, and satisfies 

Q (u, a, $,...) = , [ Q 2 (u,aJ,...) = 1 . (40) 

J o 

The problem in general is that a, f3, . . . will themselves be functions of the weights. In the case of 
the Student T in standard multivariate form we have homogenity of the system in that the same 
value of the single extra parameter v applies to all the assets and to any portfolio built from them 
by linear combination. This is what allows us to work with such a system using simple analytics. 
Similar constructions could be done with any conditionally Gaussian form with a homogeneous 
randomization of the variance. 

The determination of v is something that is probably best done, at least initially, by maximum 
likelihood estimation. The temptation to use a simple kurtosis model should be resisted. This is 
for the simple reason that the total kurtosis is given by 

Hot = 3 + (41) 

v — 4 

and is undefined for v < 4, which is a domain of considerable interest given the data studies. 

Finally we note that is likely that other risk functions will admit a similar decomposition to 
simple mean-s.d. form for the case of the Student T. 
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Appendix: Tail series coefficients for the T-quantile 

di = 1 



d 2 
d 3 



da 



1 



v + 2 

(v-2)(v + 3) 
2(i/ + 2) 2 (^ + 4) 

(v - 2) (^ 3 + Qu 2 + 2v- 18) (42) 

3(^ + 2) 3 (i/ + 4)(^ + 6) 
{y -2)(v + 5) (6^ 5 + 59z/ + 95z^ 3 - 284i/ 2 - 380u + 576) 

24(^ + 2) 4 (i/ + 4) 2 (u + 6)(u + 8) 
(y - 2){v + 3) (2u 7 + 37u 6 + 192u 5 + 26u 4 - 1430^ 3 - 48i/ 2 + 3576^ - 2400) 
10(i/ + 2) 5 (z^ + 4) 2 (zy + + 8)(i/ + 10) 

The associated recurrence for making more terms is given in Section 3.4 of [15]. Precision of better 
than 10 -9 on the region < u < 0.025 is obtained for 2 < v < 18 with e.g. 40 terms and the 
answers are then double precision quality for v < 6. 



